Semi-Autonomous Data Enrichment and Optimisation for Intelligent Speech Analysis
نویسنده
چکیده
Intelligent Speech Analysis (ISA) plays an essential role in smart conversational agent systems that aim to enable natural, intuitive, and friendly human computer interaction. It includes not only the long-term developed Automatic Speech Recognition (ASR), but also the young field of Computational Paralinguistics, which has attracted increasing attention in recent years. In real-world applications, however, several challenging issues surrounding data quantity and quality arise. For example, predefined databases for most paralinguistic tasks are normally quite small and few in number, which are insufficient for building a robust model. A distributed structure could be useful for data collection, but original feature sets are always too large to meet the physical transmission requirements, for example, bandwidth limitation. Furthermore, in a hands-free application scenario, reverberation severely distorts speech signals, which results in performance degradation of recognisers. To address these issues, this thesis proposes and analyses semi-autonomous data enrichment and optimisation approaches. More precisely, for the representative paralinguistic task of speech emotion recognition, both labelled and unlabelled data from heterogeneous resources are exploited by methods of data pooling, data selection, confidence-based semi-supervised learning, active learning, as well as cooperative learning. As a result, the manual work for data annotation is greatly reduced. With the advance of networks and information technologies, this thesis extends the traditional ISA system into a modern distributed paradigm, in which Split Vector Quantisation is employed for feature compression. Moreover, for distant-talk ASR, Long Short-Term Memory (LSTM) recurrent neural networks, which are known to be well-suited to context-sensitive pattern recognition, are evaluated to mitigate reverberation. The experimental results demonstrate that the proposed LSTM-based feature enhancement frameworks prevail over the current state-of-the-art methods.
منابع مشابه
Design and Implementation of an Intelligent Part of Speech Generator
The aim of this paper is to report on an attempt to design and implement an intelligent system capable of generating the correct part of speech for a given sentence while the sentence is totally new to the system and not stored in any database available to the system. It follows the same steps a normal individual does to provide the correct parts of speech using a natural language processor. It...
متن کاملModeling and Intelligent Control System Design for Overtaking Maneuver in Autonomous Vehicles
The purpose of this study is to design an intelligent control system to guide the overtaking maneuver with a higher performance than the existing systems. Unlike the existing models which consider constant values for some of the effective variables of this behavior, in this paper, a neural network model is designed based on the real overtaking data using instantaneous values for variables. A fu...
متن کاملIntelligent Auto pilot Design for a Nonlinear Model of an Autonomous Helicopter by Adaptive Emotional Approach
There is a growing interest in the modeling and control of model helicopters using nonlinear dynamic models and nonlinear control. Application of a new intelligent control approach called Brain Emotional Learning Based Intelligent Controller (BELBIC) to design autopilot for an autonomous helicopter is addressed in this paper. This controller is applied to a nonlinear model of a helicopter. This...
متن کاملComposite Kernel Optimization in Semi-Supervised Metric
Machine-learning solutions to classification, clustering and matching problems critically depend on the adopted metric, which in the past was selected heuristically. In the last decade, it has been demonstrated that an appropriate metric can be learnt from data, resulting in superior performance as compared with traditional metrics. This has recently stimulated a considerable interest in the to...
متن کاملIntelligent Resource Exchanges: Solutions and Pathways in a Workforce Allocation Problem
This paper considers the problem of resource allocation in the service industries approached from an agent-based perspective. Agent technologies seem to be well suited to this domain by providing a distributed environment, are network centric, semi-autonomous and collaborative and can communicate with each other to achieve better optimisation with little human intervention. The paper describes ...
متن کامل